Goto

Collaborating Authors

 video compression


Neural B-frame Video Compression with Bi-directional Reference Harmonization

Neural Information Processing Systems

Neural video compression (NVC) has made significant progress in recent years, while neural B-frame video compression (NBVC) remains underexplored compared to P-frame compression. NBVC can adopt bi-directional reference frames for better compression performance. However, NBVC's hierarchical coding may complicate continuous temporal prediction, especially at some hierarchical levels with a large frame span, which could cause the contribution of the two reference frames to be unbalanced. To optimize reference information utilization, we propose a novel NBVC method, termed Bi-directional Reference Harmonization Video Compression (BRHVC), with the proposed Bi-directional Motion Converge (BMC) and Bi-directional Contextual Fusion (BCF).







General response (R1, R2, R3)

Neural Information Processing Systems

Dear Reviewers, we thank you for taking the time to provide valuable feedback. Below we address the main issues raised. Its performance depends on our ability to predict the distribution over future frames with low entropy. We will emphasize these aspects more in a revised version. RNNs to model dynamics in the latent space.